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PECTINOPHORA GOSSYPIELLA (PINK BOLLWORM) 
BACILLUS THURINGIENSIS TOXIN RECEPTOR BT-R 2 

CROSS-REFERENCES TO RELATED APPLICATIONS 

This Application for Patent claims the benefit of 
priority from, and hereby incorporates by reference the 
entire disclosure of, co-pending U.S. Provisional 
Application for Patent Serial No. 60/161,564 filed 
October 26, 1999. 

TECHNICAL FIELD OF THE INVENTION 

This invention generally relates to receptors for 
Bacillus thuringiensis (BT) toxin and thus to 
pesticides able to bind the receptor, and to 
ameliorating pesticide resistance. In particular, the 
invention relates to recombinant DNA and expression 
systems for a novel receptor and receptor elements from 
Pectinophora gossypiella, the pink bollworm. 

BACKGROUND OF THE INVENTION 

Without limiting the scope of the invention, its 
background is described in connection with uses of 
Bacillus thuringiensis toxins as cotton insect biocidal 
agents, as an example. Cotton insect pests reduced 
yields by almost 10% across the US in 1998. Insect 
damage reduced the overall cot-ton yield by more than 
1.7 million bales and produced a financial loss of 
about $1,224 billion. One group in particular, the 
bollworm/budworm complex was the most damaging causing 
a 2.7% loss. The pink. bollworm, Pectinophora 

gossypiella Saunders ( " PBW" ) , is a lepidopteran insect 
that causes severe damage to cotton and is the most 
destructive pest of cotton worldwide. 
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Bacillus thuringiensis is a gram positive, 
sporerorming bacterium that forms a parasporal crystal 
which contains insecticidal toxins (Bulla et al . , Crit. 
Rev. Microbiol. (1980) 8: 147-204; Hofte and Whiteley, 
Microbiol. Rev. (1989) 53: 242. The effect of the 
toxin is mediated through binding to specific receptors 
on the apical brush border of the midgut microvillae 
(B3MV) of susceptible insects. 

Biological control of cotton pests using B. 
thuringiensis formulations and transgenic plants has 
been in use for a number of years and is growing 
rapidly. Recently, transgenic cotton plants carrying 
the toxin genes of BT have been developed and sold 
commercially. Such transgenic plants have a high 
degree of resistance to the pink boilworm (Schnepf et 
al . , Microbiol. Mol . Biol. Rev. (1998) 62: 775). 
However, the introduction of any new insecticide into 
a pest management program immediately initiates a 
selection process for individuals that are resistant to 
the pesticide. As the use of transgenic crops 
expressing BT toxin increases, insect resistance is 
expected to become more widespread. Increased 
tolerance for BT toxins in several species of insects 
has been reported by several investigators while 
laboratory selection experiments have shown that the 
use of BT toxin formulations and transgenic plants can 
provoke the development of resistance in the pink 
boilworm (Bartlett, et al . , Beltwide Cotton Conference 
(1995) 2 : 766) . 

Concerns that BT toxin formulations or transgenic 
plants expressing the toxin genes may evoke emergence 
of either resistant or tolerant strains of insects has 
made the search for a better understanding of the 
interaction between the BT toxin proteins and their 
respective insect receptors a matter of considerable 
economic importance . 
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In U.S. Patent No. 5,693,491, the present 
inventors disclosed the purification and cDNA cloning 
of a B . thuringiensis toxin receptor BT-Ri from larvae 
of the tobacco hornworm Manduca sexta (M. Sexta) . 
5 Recently, two BT toxin receptors have been identified, 

purified and cloned from the silkworm, Bombyx mori 
(Nagamatsu et al . , Biosci . Biotechnol . Biochem. (1998) 
62 : 727) . 

Heretofore in this field, there has been no 
10 structural information concerning the structure and 

function of BT toxin receptor of the major cotton 
insect pest, P. gossyqyiella . Furthermore, to the 
inventors 1 knowledge, the minimum binding fragment 
encoding a consensus binding domain for BT toxin on the 
15 BT receptor has not yet been identified. Isolation of 

the minimum binding fragment could permit cloning and 
structural characterization of important yet 
uncharacterized BT toxin receptors from other insects 
of worldwide economic importance such as P . 
2 0 gossyipiella . 

SUMMARY OF THE INVENTION 

The present invention provides information and 
materials for isolation and expression of novel BT 
crystal toxin receptors, herein referred to as Cry 
25 toxin receptors. Generally, the invention provides 

structural and functional characterization of a novel 
lepidopteran BT toxin, receptor, herein referred to as 
BT-R,. 

A cDNA that encodes an alternative glycoprotein 
30 receptor from the pink .bollworm that binds specifically 

to a B. thuringiensis toxin has been cloned, sequenced 
and characterized. The BT-R, cDNA permits the analysis 
of receotors in pink bollworm and other insects and 
organisms that affect crop growth and development, as 
35 well as the design of assays for the cytotoxicity and 
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binding affinity of potential pesticides. The clone 
and other methods described herein, permit the 
manipulation of natural and/or introduced homologous 
receptors and, thus, to specifically destroy organisms, 
tissues and/or cells of the target host, including 
insects resistant to toxins of B . thuringiensis . 

The invention further provides purified and cloned 
cDNA encoding a 200 kD receptor for the CrylA toxins of 
the pink bollworm, P. gossypiella. An advantage of 
this invention is the identification of the minimum 
binding fragment encoding the toxin binding domain on 
the BT toxin receptor. Another advantage of this 
invention is the provision of methodologies for cloning 
and structural characterization of presently unknown BT 
receptors. Furthermore, this invention provides 
methods and materials for identification and design of 
effective toxin binding receptors for use in combating 
emergence of toxin resistance. Also, this invention 
may be used to generate transgenic organisms expressing 
toxin receptors. 

BRIEF DESCRIPTION OF THE DRAWINGS 

A more complete understanding of the method and 
apparatus of the present invention may be obtained by 
reference to the following Detailed Description when 
taken in conjunction with the accompanying Drawings 
wherein : 

FIGURES 1A-B show the nucleotide sequence cDNA 
encoding the BT-R 2 protein from P . gossypiella (SEQ ID 
NO:l) ; 

FIGURES 2A and 2B show the amino acid sequence of 
BT-R2 protein from P. gossypiella (SEQ ID NO: 2) . 
Arrows indicate the start site of the putative cadherin 
domains CR1 - CR12 , SIG = signal sequence (double 
underline) ; MPD = membrane proximal domain; CYT = 
cytoplasmic region. The transmembrane region is 
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underlined and bold. The leucine zipper motif LZ is 
underlined. N residues denote putative N-glycosylation 
sites. The minimum binding fragment MBF (aa 1269-1367) 
is also double underlined; 

Figure 3A is a graph showing the binding results 
of CrylA toxins on P . gossypiella larvae brush border 
membrane vesicles prepared from midgut epithelial 
cells ; 

Figure 3B is a graph showing the toxicity results 
of CrylA toxins on P. gossypiella larvae and BBMV; 

Figure 4 is a map of the structure of the pink 
bollworm (PBW) BT-R, cDNAs , including truncations PBW- 
1210-1439, PBW-1269-1439, PBW- 1 3 67 - 14 96 , and PBW-1210- 
1367 (the minimum binding fragment) . The binding of 
proteins expressed from each clone to CrylA toxin was 
identified by (+) for binding and {-) for non-binding ; 
and 

FIGURES 5A-C illustrate an alignment of the silk 
worm (tOD) , the tobacco hornworm (middle) , and the pink 
bollworm (bottom) Cry toxin receptors. Perfectly 
conserved residues are boxed. 

DETAILED DESCRIPTION OF THE PRESENTLY PREFERRED 
EXEMPLARY EMBODIMENTS 

The present invention will now be described more 
fully hereinafter with reference to the accompanying 
drawings, in which preferred embodiments of the 
invention are shown. This invention may, however, be 
embodied in many different forms and should not be 
construed as limited to the embodiments set forth 
herein; rather, these embodiments are provided so that 
this disclosure will be thorough and complete, and will 
fully convey the scope of the invention to those 
skilled in the art. 
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Abbreviations and Definitions 

The following abbreviations are used throughout 
this application: bp - base pairs; BT - Bacillus 
thuringiansis or B. thuringiensis ; BT-R X - BT toxin 
receptor of type x; BBMV - brush border of the membrane 
vesicles; cDNA - complementary DNA; Cry toxin - 
parasporal crystalline toxin of BT; IEF 

Immunoelectrophoresis; kb - kilobase or kilo base 
pairs; kD - kilodaltons; K d - dissociation constant; 
LC 5 n - lethal concentration resulting in a 50% 
mortality; PBW - pink bollworm, Pectinophora 

gossypiella or P. gossypiella ; PCR - polymerase chain 
reaction; RACE - Rapid Amplification of cDNA Ends; RT 
- reverse transcriptase; SW - silkworm {Bombyx mori or 
B. mori); THW - tobacco hornworm (Manduca sexta or M, 
sexza) ; and UTR - untranslated region. 

The term "x% homology" refers to the extent to 
which two nucleic acid or protein sequences are 
identical as determined by BLAST homology alignment as 
described by T . A . Tatusova & T.L. Madden (1999), "Blast 
2 sequences - a new tool for comparing protein and 
nucleotide sequences", FEMS Microbiol Lett. 174:247-250 
and using the following parameters: Program (blastn) or 
(blastp) as appropriate; matrix (OBLOSUM62), reward for 
match penalty for mismatch (-2); open gap (5) and 

extension gap (2) penalties; gap x- drop off (50); 
Expect (10); word size (11); filter (off). An example 
of a web based two sequence alignment program using 
these parameters is found at 

http: //www.ncbi .nlm.nih.gov/gorf/bl2 .html . 

The invention thus includes nucleic acid or 
protein sequences that are highly similar to the 
sequences of the present invention, and include 
sequences of 80, 85, 90, 95 and 98% similarity to the 
sequences described herein. 
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The invention also includes nucleic acid sequences 
that can be isolated from genomic or cDNA libraries or 
prepared synthetically, and that hybridize under high 
stringency to the entire length of a 400 nucleotide 
5 probe derived from the nucleic acid sequences described 

herein under. High stringency is defined as including 
a final wash of 0 . 2X SSC at a temperature of 60°C. 
Under the calculation: 

Eff Tm = 81.5 + 16.6(log M [Na+] ) + 0.41(%G+C) - 
10 0.72 (% formamide) 

the percentage allowable mismatch of a gene with 50% GC 
under these conditions is estimated to be about 12%. 



The nucleic acid and protein sequences described 
herein are listed for convenience as follows: 



SEQ ID 
Nos . : 


DNA and Protein Sequences 


SEQ ID NO: 


BT-Ro cDNA sequence from P. gossypiella 
(Figure 1) 


SEQ ID NO: 

2 


BT-R 2 protein sequence for P. 
gossypiella (Figure 2) 



25 



SEQ ID 
NOS . : 


Primer Sequences 


Primer 
Name 


SEQ ID 
NO: 3 


5 ' CAN ATH CGN GCN CAN GAY 
GGN GG 3 ' 


BTR 1209U 


SEQ ID 
NO: 4 


5 * TTG TAC ACS GCW GGS ATW 
TCC AC 3' 


BTR 1355U 


SEQ ID 
NO: 5 


5 ' NAC YTG RTC RAT RTT RCA 
NGT CAT 3 1 


BTR 1486D 


SEQ ID 
NO: 6 


5 ' NCC DAT NAG RTC NGA RTC 
RTT NGA 3 ' 


BTR 1657D 


SEQ ID 
NO: 7 


5' TAG GTT GTA TCC TCA GTA 
TGA GGA 3 ' 


PBW-BTR 
GSP-1 


SEQ ID 
NO: 8 


'5' CCA GAG TGG AGT CCA CCG 
CCA TA 3 ' 


PBW-BTR 
GSP-2 
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SEQ ID 
NO: 9 


5 ' CTG AGT AAG TGT TAT CTT 
GAA AG 3 ' 


PBW-BTR 
GSP-3 


SEQ ID 
NO: 10 


5 ' CAN ATH CGN GCN CAN GAY 
GGN GG 3 ' 


BTR 1209U 


SEQ ID 
NO: 11 


5 ' GAT AGC GGC CCC AGG AAC 
CAA CAA ACA GG 3 ' 


PBW-BTR 
GSP-4 


SEQ ID 
NO: 12 


5 1 AGT GCG AGT GCT TTG AAT 
CTG TGA 3 » 


PBW-B ■ IR 
P2U 


SEQ ID 
NO: 13 


5 1 GTC TCT TCT CAC CGT CAC 
TGT CAC T 3 ' 


PBW-BTR 
P5U 


SEQ ID 
NO: 14 


5 ' GCA TGC TGG CAG TAG GTT 
GTA TC 3 1 


PBW-BTR 
P6D 


SEQ ID 
NO: 15 


5« GGC CAC GCG TCG ACT AGT AC 
3 ' 


(AUAP) 


SEQ ID 
NO: 16 


5 ' GGC CAC GCG TCG ACT AGT 
^^t-p r j 1f j ir p TTT TTT TT 1- ^ -1 'ipT" - ^ - ""P 3 ' 


(AP) 


N = A, C, T, or G; H = A, T, or C; B = T, C, or G; D 
= A, T, or G; V = A, C, or G; R = A or G; Y = C or T; 
M - A or C; K = T or G; S = C or G; W - A or T 



20 More particularly, the studies described herein 

were targeted toward the identification, cloning and 
characterization of novel Cry toxin receptors. One 
embodiment was directed to characterization and 
isolation of the heretofore unidentified Cry toxin 

25 receptor of the pink bollworm, P. gossypiella , 

hereinafter referred to as "PBW". 

In order to identify and isolate the Cry toxin 
receptor of the PBW, toxicity was determined for five 
different Cry proteins (CrylAa, CrylAb, CrylAc, Cry3A 

30 and CryllA) against neonate PBW larvae. It was 

determined that the lepidopteran- specif ic toxins 
(CrylAa, CrylAb and CrylAc) showed high toxicity toward 
P3W larvae with a LC 5C ranging from 25-45 ng/cnr of 
insect diet, while the coleopteran specific (Cry3A) or 

35 the dipteran specific (CryllA) toxins did not exhibit 

any detectable toxicity up to 2000 ng/cnr (Figure 3) . 
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The binding of the three lepidopteran-specif ic 
CrylA toxins ( CrylAa ; CrylAb and CrylAc) to the BBMV of 
P. gossypiella was characterized in detail. Ligand 
blot experiments showed that proteins of 120 kD bind 
5 only the CrylAc toxin whereas a 200 kD protein binds to 

CrylAa, CrylAb and CrylAc toxins. It is now known that 
the 120 kD protein is a heat shock protein, although 
its relation to the Cry toxin effect is not understood. 
In the case of the 175 kD cadherin- like CrylAa 

10 binding protein from Bowbyx mori , ' 2zi l -labeled CrylAa 

binding was eliminated by the presence of unlabeled 
CrylAa, but additional band(s) of approximately 110 kD, 
identified by 125 T-CrylAa ligand blots, failed to 
demonstrate a detectable degree of competition. Thus, 

15 it was determined that P. gossypiella, like M. sexta 

and B. mori , contains both high-affinity and low- 
affinity binding proteins for at least one CrylA toxin 
and that the 200 kDa protein from PBW is a common 
binding protein for the lepidopteran-specif ic CrylA 

20 toxins. 

The detailed mechanism of the CrylA toxin 
interaction with the midgut BBMV of the pink boll worm 
was determined. The equilibrium dissociation constants 
(K d ) calculated from the homologous competition assays 

25 (Figs. 3A and 3B) are 16.5, 12.4 and 12.8 nM and the 

concentrations of binding sites are 3.7, 3.6 and 8.6 
pmol/mg, for CrylAa, CrylAb and CrylAc, respectively. 
The Hill Coefficients for the three CrylA toxins are 
between 0.6 and 0.8 for BBMV binding proteins (Figure 

30 3A) , indicating that there is negative cooperat ivi ty in 

the binding of these toxins to the binding site(s) in 
the BBMV. Binding of the CrylA toxins to BBMV proteins 
was specific and saturable. The toxin amount required 
for saturation of 460 /ig of BBMV proteins was in the 

35 following order: CrylAoCry 1 Aa >Cry lAb . 
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Immunoprecipitation of BBMV proteins with anti- 
CrylAb antiserum and subsequent ligand blotting with 

-or, 

"■'I -CrylAb toxin also showed binding of the toxin to an 
approximately 200 kD protein. The 200 kD protein is a 
single protein as shown by 2D-gel analysis (data not 
shown) . A comparison between the 210 kD binding 
protein from M . sexta with a pi ~ 4.3 and the 200 kD 
binding protein from P. gossypiella (pi - 4.1) revealed 
that both proteins have almost the same pi. it was 
determined that the 200 kD PBW protein had some cross- 
reactivity with polyclonal antisera against the M. 
sexta BT-R- 210 kD protein. 

In order to clone the PBW 3T-R, gene, fully 
degenerate primers were designed based on the conserved 
amino acid sequences between that of the two receptors, 
tobacco hornworm ("THW") BT-R, and silkworm ("SW") BT- 
R175. The primer locations were designed to include or 
exclude a sequence thought by the present inventors to 
encode a region in the extracellular domain critical to 
toxin binding, herein after "READ" signature sequence. 
Hereinafter this binding fragment of the DNA sequence 
will be referred to as the "signature" region. 

Three clones were obtained, PBW-421 (aa 1367- 
1496), PBW-866 (aa 1210-1496) and PBW-1373 (aa 1210- 
1675) , which have about 50 % nucleotide and about 60% 
amino acid sequence similarity to both THW BT-R, and SW 
BT-R175. The 421 bp and 866 bp clones encode proteins 
of about 21 and 32 kD, respectively. Although both 
expressed proteins cross-reacted with THW BT-R, 
polyclonal antisera, the 32 kD protein, but not the 21 
kD protein, was shown to bind CrylAb toxin specifically 
with high affinity. The estimated K d value is about 
17 nM, which is similar to the K d value obtained for 
3BMV . Similarly, an internal fragment from the PBW- 
866 clone did not bind toxin, but did cross-react with 
BT-Rj antibodies. This data demonstrates that 
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recognition by anti-BT-R : antibodies is insufficient to 
define a functional toxin receptor. 

In order to obtain a cDNA sequence encoding the 
full-length receptor, the 5' and 3' ends of the PBW BT- 
5 R 2 receptor were first obtained using 5 ' and 3 1 RACE 

reactions followed by cloning of the full-length 
receptor cDNA using gene specific primers from the 5' 
and 3 1 UTR. The full-length" cDNA clone (SEQ ID NO: 1) 
has an open reading frame of 1729 amino acids (SEQ ID 

10 NO:2), with a deduced molecular weight of 194 kD and a 

calculated pi value of 4.1, which is similar to the 
value determined by 2-D gel analysis. 

The protein consists of three domains: 
extracellular, transmembrane and cytoplasmic. The 

15 protein sequence contains two hydrophobic regions, one 

at the amino terminus, characteristic of a signal 
peptide and one near the COOH- terminus (amino acids 
1575-1600) that probably forms a transmembrane domain. 
The extracellular domain contains 12 cadherin- like 

20 motifs, in addition to, a membrane proximal region that 

contains two leucine zipper motifs. Eleven consensus 
sites for N-linked glycosylat ion are present in the 
extracellular region, which may account for the 
difference in apparent molecular mass between the 

25 native protein and the calculated mass. 

Based on the results discussed above, it would be 
apparent to one of ordinary skill in the art that 
variances in receptor sequences or in toxin binding 
affinities or in receptor expression may render 

30 different levels of toxin susceptibility or resistance. 

Furthermore, the receptor of the present invention may 
be used to generate transgenic organisms by methods 
well known in the art. 

To investigate the mode of action of BT toxin, a 

35 mammalian heterologous cell culture system was chosen 

for several reasons. First, BT CrylA toxins have shown 
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no toxic effect on any mammalian cell lines studied to 
date. This characteristic is in contrast to most 
available insect cell lines, which exhibit variable 
degrees of sensitivity to toxin (Kwa et al . , 1998). 
Second, the use of a mammalian cell would allow the 
determination of whether the receptor, independent of 
any associated protein in an insect cell line, would 
mediate toxicity. 

When introduced into mammalian COS- 7 cells, the 
cloned cDNA expressed BT-R 2 that was detected by 
western blot analysis using BT-R : antisera. The 
expressed receptor was displayed on the cell surface 
and detected with polyclonal antibodies raised against 
M. sexta BT-R;. These results suggest that the protein 
expressed by the PBW BT-R Z cDNA is similar to the 
natural protein found in the insect midgut. 

The possibility of using COS-7 mammalian cells 
transfected with a receptor for BT toxins as a model 
system for assessing the cytotoxicity of the CrylA 
toxin was determined. The surface receptor clearly was 
able to bind to the CrylAb toxin, which was detected 
by immunof luorescent labeling using CrylAb antibodies 
(data not shown) . These results indicate that the 
binding site of the receptor must assume its native 
conformation. Significantly, intensively labeled 
vesicles in the methanol fixed transfected COS-7 cells 
were observed when the cells were incubated with BT-Ri 
antiserum (data not shown) . This observation indicates 
that vesicles, which form normally in the cell 
endocytosis/exocytosis pathway, contain the BT-R 2 
proteins. In addition, this result shows that the 
receptor is not only expressed on the cell surface, 
like its native counter part in the insect midgut, but 
also is recycled normally by the cell. 

Microscopy of the transfected COS-7 cells treated 
with CrylAb toxins for various times demonstrated 
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significant cytopathological patterns. The 
cytopathological changes observed under the fluorescent 
microscope included disruption of the plasma membrane, 
cell swelling, disintegration and death of the cells. 
5 The symptoms were obtained in the presence of 0.6 /ig/ml 

CrylAb for 2 hr. In contrast, no cytopathological 
effects were revealed for cells transfected with vector 
alone and subsequently treated with toxin. Clearly, 
there is a distinct correlation between toxin binding 

10 to the surface receptor and toxicity to the cells. 

The cytological appearance and ul trastructure of 
the midgut cells of M. sexta and other lepidopteran 
larvae, after intoxication with preparations of BT, 
have been reported extensively by several authors 

15 (Bravo et al . , 1992) , Histopathological studies on M. 

sexta midgut demonstrated pathological behavior for 
CrylA on midgut epithelial cells (columnar cells) 
(Midhoe et al . , 1999). These investigators 

demonstrated that the epithelial cells of the midgut 

20 swell shortly after ingestion of the BT toxin. 

Eventually, the epithelial cells burst and released 
their cytoplasmic contents into the midgut lumen. 

The present observations on the intoxicated 
transfected COS-7 cells are in complete agreement with 

25 these reports, which demonstrates that the toxin acts 

similarly in both systems. Furthermore, it should be 
apparent to one of ordinary skill in the art that cells 
expressing transfected molecules of the BT toxin 
receptor as well as cells expressing a natural form of 

30 the receptor may be used to asses the level of 

cytotoxicity and mode of action of toxins. 

Lepidopteran insects generally express high 
molecular weight binding proteins for the CrylA toxins 
that range in size from 160 to 220 kD {Mart mez -Ramirez 

35 1994; Vadlamudi et al.; 1993, Oddouet al . , 1993; 

Nagamatsu et al . , 1998a; Ihara et al., 1998). Two of 
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these proteins, in addition to the 200 kD pink bollworm 
receptor, have been cloned and sequenced: the BT-R-. 210 
kD cadherin-related receptor from M. sexta (Vadlamudi 
et al., 1995) and the 175. kD cadherin-related from B. 
mori (Nagamatsu et al . , 1998a). Interestingly, these 
two proteins have 60-70% identity and 80% similarity 
between themselves . 

P. gossypiella expresses a high-affinity and a 
low-affinity binding protein for at least one CrylA 
toxin, CrylAc. The high-affinity receptor is a 
cadherin-related protein with a large molecular mass. 
One of the most important conserved regions may be the 
signature sequence. The signature sequence contains 
the sequence (READ) , which is believed to be 
responsible for toxin binding due to the presence of 
two negatively charged amino acids that bind to two 
arginines in the toxin binding site. Supporting 
evidence comes from the immunoblot analysis for clones 
PBW-866, which contains the proposed signature 
sequence, and PBW-421, which does not include the 
signature sequence. To further define the minimum 
binding fragment, truncation peptides were tested for 
their ability to bind toxin (Figure 4). The minimum 
binding fragment contains the "READ" signature sequence 
and consists of amino acids 1269 to 1367. 

The information provided herein is necessary for 
understanding the molecular biology of the toxin 
receptor in the pink bollworm and to engineer more 
effective toxins in terms of longer persistence in the 
field, higher toxicity, and preclusion of resistance 
development. This information will facilitate 

understanding of Cry toxin receptor interactions in 
other economically important insect crop pests. 



14 



WO 01/34807 



PCT/US00/41521 



Example 1 Specificity of Purified Toxins 

Recombinant protoxins CrylAa, CrylAb, and CrylAc 
(Bacillus Genetic Stock Center, Ohio State University) 
were prepared from E . coli JM-103 and trypsinized 
5 essentially as described by Lee et al . J. Biol. Chem. 

(1992) 267: 3115. In addition, the soluble trypsinized 
60 kD toxins were subjected to FPLC NaCl salt gradient 
purification over an HR-5/5 Mono-Q anion exchange 
column (PHARMACIA™) prior to quantitation, radio- 

10 iodination, and use in bioassays. Cry3A crystal 

protein from B . thuringiensis subsp. tenebrionis was 
solubilized in 3 . 3 M NaBr and treated with papain, and 
the resulting 67 kD toxin was purified by the method of 
Li et al. Nature (1991) 353: 815. The 65 kD CryllA 

15 toxin was isolated from B. thuringiensis subsp . 

israelensis via solubilization as described by Chilcott 
et al. J. Gen. Micro (1988) 134: 1551 and further 
purified by anion- exchange FPLC. All toxin protein 
quant illations were performed using the bicinchoninic 

20 acid method (PIERCE CHEMICAL™ ) with Bovine Serum 

Albumin (BSA, Fraction V) as a standard. 

Pink bollworms were obtained from the USDA PINK 
BOLL WORM REARING FACILITY™ ( PBWRF , Phoenix, AZ) . An 
artificial diet was obtained from SOUTHLAND PRODUCTS 

25 INC.™, Lake Village, AR. The diet was reconstituted 

in boiling water and cooled to 55°C. Each Cry toxin 
was thoroughly mixed in the warm liquid diet and 
bioassay cups were filled with 20 ml of diet. After 
cooling and drying, 10 neonate larvae were placed in 

30 each cup and the cups were immediately capped. The 

method of Watson, et al . , Seltwide Cotton Conference, 
Memphis, Tenn . (1995) was used to determine the 
toxicity of trypsin-activated toxins against first- 
instar larvae of P. gossyqpiel la . Generally, four 

35 replicates of six cups were prepared for each dose. 

Cups were incubated at 30° C for 21 days, the length of 
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time necessary for more than 95% of normal P. 
gossypiella to reach pupation. At the end of 21 days, 
the diet cups were examined and the numbers of larvae 
and numbers of pupae or adults in each cup were 
recorded . 

The specific toxicities of purified CrylAa, 
CrylAb, CrylAc, Cry3A and Cry 2A tested using neonate 
P. gossypiella larvae are shown in Figure 3B. It was 
determined that all three CrylA toxins are highly 
toxic, with LC:, S values ranging from 25-45 ng/cm 3 of 
artificial diet. Cry3A (considered toxic to 

coleopteran or beetle insects) and Cry IIA (considered 
toxic to dipteran insects, especially mosquitoes) were 
not toxic to P. gossypiella larvae at the highest 
concentrations tested (2000 ng/cm-') . 

Example 2 Characterization of the BT-R 2 Receptor 

Early fourth- instar larvae were kept on ice for 1 
hr and midguts were surgically removed from the larvae. 
BBMW were prepared from midgut tissues by the 
20 differential magnesium precipitation method of 

Wolf ersberger , et al . , Comp. Biochem. Physiol. (1987) 
86A: 30, in the presence of protease inhibitors (5 
mg/ml pepstatin, antipain, aprotonin, leupeptin, 1 mM 
PMSF, and 5 mM benzamidine) . The final pellet was 
25 resuspended in buffer A (300 mM mannitol, 5 mM EGTA, 

and 17 mM Tris-HCl, pH 7.5) containing the protease 
inhibitors, flash frozen in liquid nitrogen, and stored 
at -85°C. 

Cry toxins were radioiodinated using the 
30 chloramine T method (Hunter and Greenwood, Nature 

(1962) 194: 495, with 125 I-Na (NEN DUPONT ™) . Ten fig 
of toxin were mixed with 5 fil of m I-Na (0.5 mCi) in 
100 ill of NaHP0 4 buffer (0.5 M, pH 7.4) with 25 /xl of 
Chloramine T (4 mg/ml). The reaction mixture was - 
35 agitated for 20-25 seconds at 23°C and the reaction was 
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stopped by adding 50 fil of Na 2 S 2 0 5 (4.4 mg/ml). Free 
iodine was removed by gel filtration on an EXCELLULOSE 
™ desalting column (PIERCE ™) equilibrated with PBS 
containing 10 mg/ml BSA . 



5 Toxin Binding Assays . 

Both homologous and heterologous competition 
inhibition binding assays were performed as described 
by Keeton and Bulla (1997) . A total of 25 fig of BBMV 
were incubated with 1.2 nM 11>L I-CrylAc toxin in the 

10 presence of increasing concentrations (0-1000 nM) of 

the appropriate unlabeled homologous toxin (CrylAc) or 
heterologous toxins (CrylAa, CrylAb, Cry3A, and 
CryllA) . Incubations were in 100 fil of binding buffer 
(P3S/0.2% BSA) at 25°C for 30 mm. Radiolabeled and 

15 unlabeled toxins were mixed together before adding them 

to the BBMV. Unbound toxins were separated from 33MV- 
bcund toxin by centrif ugat ion at 14,000 x g for 10 min. 
The pellet containing bound toxin was washed three 
times in ice cold binding buffer by gentle vortexing 

20 and radioactivity in the final pellet was measured 

using a BECKMAN GAMMA 5500 ™ counter. Binding data 
were analyzed by the PRISM ™ program (GRAPH PAD 
SOFTWARE INC.™, San Diego) . 

Competition inhibition binding of l2z I -CrylAc 

25 toxin to P. gossypiella was carried out in the presence 

of increasing concentrations of unlabeled CrylAc, 
CrylAb, CrylAa, Cry3A and CryllA toxins. Homologous 
competition binding assays were performed with 
iodmated CrylA toxins and various concentrations of 

30 the corresponding unlabeled toxin. The binding site 

concentration (B., ax ) , and dissociation constant (K d ) of 
labeled toxins were calculated from three separate 
experiments. The equilibrium binding parameters were 
estimated by analyzing the data with the PRISM™ 

3 5 computer program. 
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Radioligand Blotting. 

The two hundred jug of BBMW proteins were 
solubilized, separated by 7.5% SDS-PAGE and transferred 
to polyvinylidene difluoride (PVDF) membrane as 
5 described by Francis and Bulla (1997) . Blots were 

blocked with TBS (10 mM Tris-HCI and 0.9 % NaCl) 
containing 5% non-fat dry milk powder, 5% glycerol 0.5% 
Tween-20, and 0.025% sodium azide for 2 hr at 25°C. 
Blocking buffer was removed and membranes were 

10 incubated for 2 hr at 25°C in an equal volume of fresh 

blocking buffer containing 2 x 10" cpm/ml (1-1.25 nM) 
of - jr, I-CrylA toxins either in the presence or absence 
of unlabeled toxins. Finally, membranes were washed 
three times with fresh blocking buffer for 10 min each, 

15 rinsed once with TBS, dried, and exposed to Kodak X-ray 

film at -80° C. 

To determine the specificity of binding to the 200 
and 120 kD proteins, blots of P3W 3BMV proteins was 
incubated with :2 "l-CrylAc toxin in the presence of 

20 increasing concentrations of unlabeled CrylAc toxin. 

IMMUNOPRECIPITATION OF CRYlAB BINDING PROTEIN. 

Immunoprecipitat ion was carried out according to 
Vadlamudi, et al. (1993). Twenty five fxl of Cry lAb 
antiserum were added to 1 ml of protein A-Sepharose CL- 

25 4B equilibrated in washing buffer (1% Nonidet P-40, 6 

mM EDTA, 50 mM Tris-HCI and 250 mM NaCl) and mixed for 
1 hr at 4°C. After washing the blot three times with 
washing buffer, 700 fig of Cry lAb toxin were added and 
the mixture were incubated for an additional 1 hr at 

30 4°C and washed again three times with washing buffer. 

Pink bollworm 3BMV proteins (6 mg) were solubilized in 
washing buffer containing 1% NP-40 and protease 
inhibitors (10 ^g/ml pepstatin, antipain, aprotonin and 
leupeptin; 5 mM iodoacetamide ; and 1 mM PMSF) . 
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Unsolubilized proteins were removed by centrif ugat ion . 
Solubilized proteins were filtered through a 0.45 fim 
filter, added to 1 ml of Sepharose-protein A beads 
linked to CrylAb toxins, and the sample was stirred 
5 gently for 1 hr at 4°C. Sepharose beads were 

centrifuged and washed four times with washing buffer 
containing 0.25% NP-40 and 0.02% SDS . The toxin- 
binding protein complex was dissociated by heating in 
Laemmli (1970) sample buffer and the binding proteins 
10 were Coomassie stained and detected by ligand blotting 

with 123 I -CrylAb and Western blot using CrylAb 
antiserum. 

Immunodetection of Pink Bollworm CryIA Receptor. 

Immunoprecipitated proteins were transferred to a 
15 PVDF membrane, blocked with 5% nonfat dry milk in PBS 

buffer and incubated at 4°C overnight in the same 
blocking buffer containing 10 ag/ml of CrylAb. 
Unbound toxin was washed with PBS. Antibodies raised 
in rabbits against the 60 kD CrylAb toxin were diluted 
20 1:1000 and hybridized to the membrane for 2 hr at 25°C 

and the blot then was washed with PBS. Peroxidase- 
conjugated goat anti-rabbit IgG was diluted 1:3000 in 
TBS blocking buffer and hybridized to the membrane for 
2 hr. The membrane then was washed extensively with 
25 PBS. Visualization of the bound toxin was accomplished 

using the Enhanced Chemiluminescence (ECL) Western 
blotting detection method (AMERSHAM™) . 



Southern Blot Analysis. 

Forty fig of PvuH digested genomic DMA from P. 
30 gossypiella or M. sexta were separated on a 0.8 % IX 

TBE-agarose gel and blotted onto a nylon membrane (BIO- 
RAD™, ZETA- PROBE GT™) . The analysis was carried out 
according to Sambrook, et al . Molecular Cloning: A 
Laboratory Manual, 2 nd Ed. Cold Spring Harbor 
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Laboratory, N. Y . (1989). The filter was hybridized 
with 3 -p-labeled, random primed, C-terminal of BT-R, 
cDNA (HincH fragment, 0.5 kb) . Filter hybridization 
was carried out at 42°C for 21 hr in 50 % formamide, 5X 
Denhardt's reagent, 1M NaCl, 2% SDS, 50 mM Tris-HCl and 
100 fig/ml of salmon sperm DNA . The filter was washed 
with 2X SSC, 0.5% SDS, then with IX SSC, 0.5% SDS, then 
with 0.5X SSC, 0.5% SDS, followed by a fourth wash with 
0.25X SSC, 0.5% SDS. Each wash was for 30 min at 42°C. 
Finally, the filter was rinsed in 2X SSC and exposed to 
Kodak X-ray film at -85°C. 

Electrophoretic Elution of Proteins . 

Electrophoresis was performed in 1 . 5 -mm- thick 
polyacrylamide slab gels using 7.5% acrylamide (pH 
3.0). After SDS-PAGE, proteins were revealed as 
transparent bands with 4 M sodium acetate solution. 
The proteins were excised using a razor blade. 
Proteins in the gel strips were fixed in 50 % (v/v) 
methanol solution for 15 min and equilibrated twice in 
0.125 M Tris-HCl buffer (pH 6.8) and 2% 2- 
mercaptoethanol for an additional 15 min. 
Equilibration of the gel strips in the above buffer 
with 1% (w/v) SDS was performed as described above. 
The equilibrated gel strips were inserted into a 
dialysis tube with a minimum amount of the buffer 
containing SDS (25 mM Tris, 190 mM glycine and 0.1% 
SDS) . Electroelution was carried out essentially as 
described by Findlay (1990) . A horizontal flat-bed 
mini-gel electrophoresis apparatus (BIO- RAD™ ) was used 
for electroelution at 50 V for 12 hr at 4°C. The 
buffer consisted of 25 mm Tris, 190 mM glycine and 0.1% 
SDS (pH 8.3). At the end of electrophoresis, the 
polarity of electrodes was changed for 30 sec to avoid 
adsorption of proteins onto the dialysis tubes. The 
buffer inside the dialysis tubes was collected and the 
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tubes were washed three times with a minimum volume of 
buffer. SDS was dialyzed out and protein was 
concentrated by using a CENTRICON-3 0 micro-concentrator 
(AM I CON) . 



5 Two-dimensional Gel Electrophoresis, 

Two-dimensional gel electrophoresis was performed 
according to the method of O'Farrell (1975). 
Isoelectric focusing was carried out in 2.0 mm (I.D.) 
glass tubes using 2.0% ampholines (pH 3.5-10; 

10 L KB /PHARMACIA™) for 9600 volt-hr. After equilibration 

for 10 min in buffer "O' , tube gels were applied to the 
stacking gels on top of 8% acrylamide (pH 8.0) slab 
gels (14 x 14 cm). SDS slab gel electrophoresis was 
carried out for 4 hr at 12.5 mA. After 

15 electrophoresis, one gel was stained with Coomassie 

blue and the others were transblotted onto PVDF paper 
overnight at 200 mA (Vadlamudi et al . , 1993) . The PVDF 
paper was blocked with powdered milk solution, 
incubated with i2r T-CrylAc or ^M-CrylAb and exposed to 

2 0 X-rav film at -85°C. 



Identification and Recovery of cDNA Encoding BT-R 2 . 

Total RNA was prepared from the midgut tissue of 
fourth instar larvae of the P3W by the guanidinium 
thiocyanate method (Chomczynki et al . Analyt. Biochem. 

25 (1987) 162: 156). Poly (A+) RNA was isolated with the 

POLYATRACT MRNA ISOLATION SYSTEM™ ( PROMEGA™ ) . First 
strand cDNA was synthesized using oligo- (dT) and random 
hexamer primers and reverse transcriptase according to 
standard methodologies and used as the template for 

30 amplification by polymerase chain reaction (PCR) of 

desired mRNAs . Degenerate oligonucleotide primers were 
designed based on the conserved amnio acids between M. 
sexta 3T-R : and B . mori BT-R175. Such primers were 
used to clone partial fragments of P3W BT - R 2 - 
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For cloning of the PBW BT-R-, RT-PCR was employed 
using fully degenerate oligonucleotide primers derived 
from a sequence in the membrane proximal domain 
conserved sequence between M sexta BT-R : and B . mori 
BT-R175. Primers BT-R-1355U and BT-R-1209U against BT- 
R-1486D were applied to PBW cDNA to amplify 421-bp and 
866-bp fragments. The PCR products were resolved on 
1.5% agarose, gel purified, cloned into a TA cloning 
vector ( INVITROGEN™) and transformed into E. coli 
INV~F. The presence and identity of the correct insert 
was confirmed with EcoRl digestion and DNA sequencing. 
The P3W-886 clone was found to contain the nucleotide 
sequence found in clone P3W-421. In addition, primer 
1209U against 1657D was used to clone a 1373-bp 
fragment (PBW-1373) , which represents most of the 
membrane proximal domain and the cytoplasmic domain. 
Clone PBW-287 (aa 1346-1438} is a 287 bp internal 
fragment from 86 6-bp clone and was cloned using gene 
specific primers P5 and P6 . 

Based on the sequence obtained from the partial 
clones, sense and antisense primers were used to clone 
the 3' and 5' ends of the PBW 3T-R 2 clone by the 5' and 
3' RACE system according to the manufacturer's 
instructions (GIBCO 3RL™) . The 5' end was amplified 
using gene-specific antisense primers GSP1, GSP2 and 
GSP3 against ABRIDGED UNIVERSAL AMPLIFICATION PRIMER™ 
(AUAP™) provided in the kit. The 3' end was amplified 
using gene primer GSP4 against AUAP™. The PCR product 
of the predicted size was isolated and subcloned into 
TA cloning vector pCR2 . 1 { INVITROGEN™ ) and transferred 
into E. coli INV~F. For recombinant protein expression 
in E . coli, or COS7 cells, the coding sequences for the 
RT-PCR clones or the full length PBW-BT-R, clone were 
recioned into the pET30 or pcDNA3 . 1 expression vectors 
and -transformed into BL21 (DE3) LysS (NOVAGEN™) or 
COS7 mammalian cells. The E. coli cultures were 



WO 01/34807 



PCT/USOO/41521 



induced using a 1 mM final concentration of IPTG for 3 
hr. 

The full length PBW BT-R 2 (-5.5 kb; see sequence 
in Figure 1 SEQ ID NO:l) was ligated into the mammalian 
5 expression vector pcDNA3 . 1 ( INVITROGEN™ ) and confirmed 

by DNA sequencing. The molecular mass of the deduced 
polypeptide is 194 kD with a pi of 4 . 1 . The receptor 
has an open reading frame of 1729 amino acids (Figure 
2) (SEQ ID NO: 2) . The amino acid sequence contains a 

10 putative signal peptide of 23 amino acid residues, a 

transmembrane domain of 27 residues (aa 1578-1605) and 
a 124-residue cytoplasmic domain. In addition, the 
amino acid sequence contains 12 putative cadherin 
motifs, 11 putative N-glycosylat ion sites and two 

15 leucin zipper motifs at amino acid 1541-1562 and 1578- 

1600. The minimum toxin binding fragment is amino 
acids 1269 to 1367 (Figure 4) . 

When the protein homology is analyzed by BLASTP, 
as described under definitions above, the closest 

20 paralog in the GenBank nonredundant. (nr) database is 

the Bombyx mori receptor at Acc . No. JE0128 with 
Identities - 1034/1708 (60%) , Positives = 1266/1708 
(73%) , Gaps - 35/1708 (2%) . The next closest species 
was Manduca sexta at Acc. No. AAB33758.1 with 

25 Identities - 871/1540 (56%) , Positives = 1101/1540 

(70%) , Gaps = 22/1540 (1%) . The nucleotide sequence 
showed no significant homologies. 

The peptide homologies amongst these three species 
are shown in FIGURES 5A-C where perfectly conserved 

30 residues are boxed. Peptide fragments of the SBW 

sequence may be used to generate specific or 
nonspecific antibodies. Usually, it is recommended 
that at least 17 amino acid peptide fragments are used 
tc generate antibodies, however, smaller peptides may 

35 also be antigenic and sufficiently complex to be 

unique. In particular, the carboxyl tail (aa 1677-end) 
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10 



15 



20 



25 



30 



of the P3W sequence is unique to this species and can 
be used to generate PBW unique antibodies. Exemplary 
peptides that may be useful as antigens (numbered with 
respect to FIGURE 5, SEQ ID NO: 2) are shown as 
follows : 



row unique Peptides 


Common Peptides 


aa - R44 


aa 291-304 


aa 697-705 


aa 622-632 


aa 886-895 


aa 791-803 


aa 1055-1066 


aa 1621-1642 


aa 1321-1331 




aa 1451-1461 




aa 1516-1525 




aa 1572-1582 




aa 1677-1729 





IMMUNODETECTION OF THE EXPRESSED BT-R 2 PROTEINS. 

Cell lysates from the induced BL21 (DE3) LysS 
bacterial cultures were elec trophoresed and 
transferred to PVDF membranes. Filters were blocked at 
4°C in 50 ml of blocking buffer containing 10 ug/ml of 
CrylAb toxin. Unbound toxin was removed by PBS. 
Rabbit primary antibodies for the THW was removed by 
PBS. Rabbit primary antibodies for the THW BT-R, 
extracellular domain or for the FPLC-purif ied CrylAb 
were diluted 1:1000 in 50 ml TBS blocking buffer. The 
filters were incubated for 2 hr with t*ie antiserum and 
washed three times with the blocking buffer. 
Peroxidase-conjugated goat anti-rabbit IgG was diluted 
to 1:2000 and incubated with filters for 2 hr at 27°C 
and was developed with the enhanced chemoluminescence 
(ECL) detection system ( MA E R S KA.M ™ ) . 
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Mammalian Expression of BT-R 2 . 

The PBW BT-R 2 cDNA cloned into pcDNA3 . 1 , a 
mammalian expression vector ( INVITROGEN™) , was 
expressed in mammalian cells (COS-7 SV40 transformed 
African green monkey cells; ATCC CRL-1651) according to 
methods described by Keeton and Bulla, Appl . Environ. 
Microbiol. (1997) 63: 3419. COS-7 cells (4 x 
ICT/well) were grown in Dulbecco's Modified Eagle 
Medium (DMEM) supplemented with 10% heat- inactivated 
fetal bovine serum (FBS) on 12 mm cover slips placed in 
a 24-well plate. 

COS-7 cells were transfected with the construct 
using the LIPOFECTAMIN PLUS REAGENT™ (GIBCO BRL™) . 
The cells were incubated for two days at 37°C in DMEM 
medium containing 10% FBS in a humidified atmosphere of 
10%CO 2 . BT-R;. was monitored by SDS-PAGE and 
immunoblotting with anti-BT-Rj or antiCrylAb antiserum. 
Surface expression was detected by immunofluorescence 
microscopy with the anti-BT-R-. antibodies. The effects 
of 3T toxin on the transfected cells were demonstrated 
by incubating the cells in the presence or absence of 
CrylAb toxin for 2 or 4 hr and monitoring the 
morphological changes by immunofluorescence microscopy 
using either anti-BT-R : or ant i- CrylAb antibodies. 
Cell death is clearly demonstrated (not shown) . 

IMMUNOFLOURESCENCE MICROSCOPY . 

COS-7 cells were grown on 12 -mm glass coverslips 
in a 24-well plate. The cells were fixed and 
permeabilized either in cold methanol (-20°C) or 4% 
paraf ormaldhyde for 15 minutes at 27°C. Coverslips 
were rinsed three times with PBS and then blocked for 
15 minutes with 1% BSA in PBS. Cells were incubated 
with primary antibody for 30 minutes at 27°C followed 
by rinsing and blocking as just described. The same 
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incubation and washing procedures were applied to 
secondary antibody. Antibodies were detected with 
TRITC goat anti- rabbit IgG. Coverslips were mounted in 
FLUROMOUNT G™ and viewed with an OLYMPUS™ microscope 
equipped with epi - fluorescence illumination and a 40X 
Apochromat lens. Photography was done with an OLYMPUS 
SPOT™ camera. 

Western Blot Analysis . 

Transfected COS-7 cells were washed with cold PBS, 
lysed in lysis buffer (50 mM Tris/HCL , 1 mM EDTA, 10 /iM 
leupeptin) and resuspended on ice for 10 minutes. 
Then, 4X sample buffer was added to the cells and 
heated at 95°C for 5 minutes. Lysates were subjected 
to electrophoresis through 7.5% SDS-PAGE, and proteins 
were electrophoret ically transferred to a PVDF filter, 
blocked and incubated with either anti-BT-R 1# or anti- 
CrylAb antibodies. 

Results: Identification of 125 I-Cry1A Binding Proteins. 

BBMV proteins of P. gossyqiiella ranged in 
molecular size from greater than 205 kD to less than 25 
kD (data not shown) as determined by SDS-PAGE. 12!> I- 
labeled CrylAa, CrylAb and CrylAc were used in ligand 
blots to identify which P. gossypiella BBMV proteins 
bind the respective toxins. Proteins that had been 
separated by SDS-PAGE were transferred to PVDF 
membranes and incubated with each radiolabeled- toxin 
separately. 125 I-CrylAa, :2b I-CrylAb and i25 I-CrylAc 
bound to a protein of about 200 kD (data not shown) . 
125 I -CrylAc bound also to a protein band at about 120 
kD. Neither CrylAa nor CrylAb bound to the 12 0 kD 
protein. The binding patterns for all three toxins 
were the same under both reducing and nonreducing 
conditions (data not shown) . 
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Results : Competition Inhibition Binding Assays . 

3 -' 3 I- labeled CrylAa, Cryl Ab and CrylAc were used 
in binding assays with P. gossypiella B3MV. 
Competition binding of 12: "l-CrylAc toxin to P. 
gossypiella was carried out in the presence of 
increasing concentrations of unlabeled CrylAa, CrylAb, 
CrylAc, Cry3A and CryllA toxins. Fifty-percent 
inhibition of CrylAc binding was observed at 10 nM of 
unlabeled CrylAc, 100 nM unlabeled CrylAa and 100 nM of 
unlabeled CrylAb. At a concentration of 1000 nm, 
unlabeled CrylAc, CrylAb and CrylAa reduced binding of 
iodinated CrylAc by 95, 82 and 80%, respectively (data 
not shown) . Neither Cry3A nor CryllA toxin competed 
for the CrylAc toxin binding site. 

Homologous competition binding assays were 
performed with iodinated CrylA toxins and various 
concentrations of the corresponding unlabeled toxin 
CrylAa, CrylAb and CrylAc showed high binding affinity 
to BBW proteins (data not shown) . Fifty-percent 
inhibition of binding of CrylA toxins was observed at 
concentrations of approximately 10 nM of the 
corresponding unlabeled toxin. These data indicate 
that each of the three toxins binds specifically with 
high affinity. The binding site concentration, B max , 
and the dissociation constant, K^, of each toxin was 
calculated from the three separate homologous 
competition inhibition experiments by analyzing the 
data with the GRAPHAD computer program (Table 1) . The 
K z values all were similar and in the low nM range 
whereas the B max for CrylAc was higher than CrylAa or 
CrylAb. The Hill coefficients for CrylAa, CrylAb and 
CrylAc were 0.65, 0.65, and 0.77, respectively, 
indicating a negative binding cooperat ivity for the 
toxins against the BBMV proteins. A single binding 
site model was indicated based on the nonlinear 
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regression analysis for both CrylAa and CrylAb. 
Significantly, CrylAc, the data was best accommodated 
by a two binding site model with high- and low-affinity 
binding sites. 

Results: Specificity of 125 i-Cry1Ac Toxin Binding in Ligand Blots. 

In view of the putative "two-binding site" model 
predicted for the CrylAc toxin, radioligand blots of P. 
gossypiella BBMV proteins were carried out with :25 I- 
CrylAc toxin in the presence of increasing 
concentrations of unlabeled CrylAc toxin. 
Autoradiography of these blots revealed significant 
reduction in the intensity of the 200 kD band (data not 
shown) . Indeed, it was undetectable at a CrylAc toxin 
concentration of 10 nM. In the case of the 120 kD 
band, however, there was virtually no reduction in the 
band intensity (data not shown) even at a CrylAc 
concentration of 1000 nM. In saturation binding 
assays, incubation of a fixed amount of each of the 
three 125 I -labeled CrylA toxins with increasing 
concentrations of BBMV showed that binding reached a 
saturation level in each case but that the level of 
CrylAc binding was substantially higher than those of 
CrylAa and CrylAb. Maximum saturable binding at 4 00 
pig/ml of BBMV was approximately 0.35, 0.05 and 1.5 ng 
for CrylAa, CrylAb and CrylAc, respectively, which 
represents an approximately 30-fold difference in 
CrylAc binding compared to CrylAb, and, it is 4 fold 
higher for CrylAc compared to CrylAa (data not shown) . 

Results: Immunoprecipitation of the CryIAb Binding Protein. 

Immunoprecipitation experiments were performed 
using CrylAb, which has the highest binding affinity of 
the three toxins, to further examine the specificity of 
binding of the toxin to" the 200 kD protein. 33MV 
proteins were solubilized in 1% NONIDET P-40™ and 
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immunoprecipitated with anti-toxin-protein A-Sepharose 
beads. The mixture of bound material was solubilized 
in SDS sample buffer containing 2 -mercaptoethanol . 
Electrophoresis and staining of the gel with Coomassie 
5 blue revealed a protein of about 200 kDa , demonstrating 

selective precipitation of the 200 kD toxin-binding 
protein. Radioligand blotting with 125 I-CrylAb showed 
a band of about 2 00 kDa (data not shown) , indicating 
precipitation of the same binding protein as that 
10 identified in previous ligand blot experiments. 

Additionally, a Western blot (data not shown) of the 
immunoprecipitated protein using CrylAb and anti-CrylAb 
polyclonal antiserum confirmed the results of the 
radio-ligand blot (data not shown) . The low-molecular 
15 weight bands at 6 0 and 52 kDa correspond to the CrylAb 

toxin and the heavy chain of IgG, respectively. 

Results : Purification of the Binding Proteins . 

To determine whether the 2 00 kD band contains more 
than one protein, the band was excised from a 7.5% SDS 
polyacrylamide gel, electroeluted , dialyzed and 
concentrated. The concentrated protein was analyzed by 
two-dimensional gel electrophoresis over a pH range, of 
3.5-10. The protein migrated as one spot with an 
estimated pi of 4.5 ± 0.2 and apparent molecular mass 
of 200 kDa. The purified 200 kD protein stained with 
Schiff's reagent (data not shown) indicating that the 
binding protein is glycosylated. The 200 kD IEF spot 
bound l2z} I -CrylAb (data not shown) corroborates the 
results from other immuncprecipi tat ion studies. 

3 0 Results : Southern Blot Analysis . 

To detect the presence of the CrylA receptor in P. 
gossypiella, genomic DNA from both insects were 
hybridized against the cloned THW BT-R, cDNA and its 
507-bp minimum binding fragment. The two probes bound 
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intensively to the PvuH fragment of M. sexta genomic 
DNA (data not shown) . There was weak hybridization to 
the P. gossypiella DNA, however, using the minimum 
binding probe and none with the full-length BT-R T probe 
(data not shown) . These results suggest that the 
minimum binding fragment from M. sexta shares a 
significant level of nucleotide similarity to the CrylA 
binding receptor in P. gossypiella, more so than to the 
full-length BT-R : receptor. 

Results : Immunodetection of Native and Cloned Pbw BT-R 2 Using BT- 
R x Antibodies . 

To confirm the relatedness of the cloned PBW 
fragment to the THW 3T-R- and its ability to bind 
toxin, it was subcloned into a pET30 expression vector. 
The native PBW BBMV proteins and the expressed proteins 
from clones PBW-287, -421 and -866 were resolved by 
SDS-PAGE, transferred to a PVDF membrane and incubated 
with either anti-BT-R : serum or CrylAb toxin followed 
by antiserum to the toxin. The results reveal that 
3BMV contain a 200 kD protein that interacts with THW 
3T-R-, antiserum (data not shown) . In addition, clones 
PBW-287, -421 and -866 which express proteins of about 
15, 21 and 32 kD, respectively, also cross - reacted with 
BT-R: antiserum. The 32 kD clone, however, was the 
only protein to bind toxin, whereas no detectable 
binding was observed with the 21 kD protein (data not 
shown) . These results confirm the sequence relatedness 
of PBW BT-R 2 to THW BT-R-. and demonstrate that the 32 
kD protein contains the toxin-binding site of the 
receptor . 

Results: Specificity of Toxin Binding to the Cloned Receptor. 

The specificity and affinity of toxin binding to 
the receptor- fragment (PBW-866) was determined using 
competition ligand blot analysis. The expressed 32 kD 
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protein was transferred to PVDF membranes and 
incubated with 125 I-CrylAb in the absence or presence of 
increasing concentrations of unlabeled CrylAb toxin. 
Autoradiography revealed significant reduction in the 
5 intensity of the 32 kD band to an undetectable level in 

the presence of 500 nM unlabeled CrylAb toxin (data not 
shown) . Bound 12d I toxin was quantitated with a gamma 
counter and the BIO-RAD IMAGER™ analysis system was 
used to calculate the binding affinity of toxin to the 

10 expressed fragment. The binding affinity (~17nM) of 

the toxin was similar to the calculated value (Table 1) 
for BBMV. These results demonstrate that CrylAb binds 
specifically with high affinity to PBW BT-R 2 866. 
Other truncation fragments were also tested, and it was 

15 determined that the minimum binding fragment consists 

of amino acids 1269 to 1367. 

Results: Expression of PBW BT-R 2 in COS-7 Cells. 

PBW BT-R-- cDNA was subcloned into the mammalian 
expression vector pcDNA3 . 1 ( INVITROGEN™ ) and 

20 transfected into COS-7 cells. Protein encoded by the 

PBW BT-R 2 cDNA was expressed as a membrane protein 
capable of binding CrylAb toxin. Membranes isolated 
from transiently transfected COS-7 cells were 
solubilized, electrophoresed, and immunoblotted either 

25 with CrylAb toxin and its antiserum or with BT-R^ 

antiserum directly. The expressed 220 kD receptor 
bound CrylAb toxin and cross-reacted with BT-R, 
antiserum. No interaction to vector transfected cells 
was observed. 

30 Expression of 3T-R 2 receptor on the cell surface 

was shown by fixing the cells in methanol or 
paraformaldehyde and incubating first with anti-BT-R, 
serum, and then with TRITC IgG secondary antibodies. 
Transfected cells portrayed bright surfaces due to the 

35 binding of BT-R : antibodies to the cell surface clearly 
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showing that the PBW BT-R, receptor is expressed on the 
cell surface. 

The surface-expressed PBW receptor binds toxin and 
kills the cells. Transfected cells were incubated with 
CrylAb toxin for 2 or 4 hr, washed, fixed and incubated 
first with anti-CrylAb antiserum, and then with TRITC 
IgG secondary antibodies. As shown by 

immunofluorescence microscopy, BT-R, expressing COS-7 
cells bound the toxin, whereas cells transfected with 
vector alone did not show any surface binding of toxin. 
Incubation of cells expressing PBW BT-R 2 with toxin for 
2 or 4 hr showed significant morphological changes 
which include loss of cell integrity, loss of cell 
cytoplasm and complete disintegration of the plasma 
membrane and cell death. 
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WHAT IS CLAIMED IS: 



1 1 . An isolated nucleic acid molecule, comprising 

2 a sequence encoding a BT toxin receptor of about 200 kD 

3 from the pink bollworm, Pectinophora gossypiella. 

1 2. The isolated nucleic acid molecule of claim 

2 1, encoding the BT toxin receptor sequence of SEQ ID 

3 NO : 2 . 

1 3. The isolated nucleic acid molecule of claim 

2 1, comprising the sequence of SEQ ID NO: 1. 

1 4. The isolated nucleic acid molecule of claim 

2 1, wherein said isolated nucleic acid molecule is 

3 capable of hybridizing at high stringency to a probe of 

4 400 contiguous nucleotides of SEQ ID NO: 1 over the 

5 entire length of said probe. 

1 5. The isolated nucleic acid molecule of claim 

2 1, wherein said BT toxin receptor is at least 85% 

3 homologous to the sequence of SEQ ID NO: 2. 

1 6. The isolated nucleic acid molecule of claim 

2 1, wherein said BT toxin receptor is at least 90% 

3 homologous to the sequence of SEQ ID NO: 2. 

1 7. The isolated nucleic acid molecule of claim 

2 1, wherein said BT toxin receptor is at least 95% 

3 homologous to the sequence of SEQ ID NO: 2. 

1 8. The isolated nucleic acid molecule of claim 

2 1, wherein said BT toxin receptor is at least 98% 

3 homologous to the sequence of SEQ ID NO: 2. 
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1 9. The isolated nucleic acid molecule of claim 

2 1, wherein the sequence is codon optimized for 

3 expression in a prokaryotic system. 

1 10. The isolated polynucleotide sequence of claim 

2 1 wherein the sequence is codon optimized for 

3 expression in a eukaryotic system. 

1 11. An expression vector, comprising the isolated 

2 nucleic acid molecule of claims 1-10. 

1 12 . A host cell, comprising the vector of claim 

2 11. 

1 13. The host cell of claim 12, which is used to 

2 assess the level of cytotoxicity and mode of action of 

3 toxins. 

1 14 . A cell, comprising a naturally occurring BT 

2 toxin receptor, which is used to assess the level of 

3 cytotoxicity and mode of action of toxins. 

1 15. A transgenic organism, comprising the vector 

2 of claim 11. 

1 16. An isolated nucleic acid molecule, comprising 

2 a sequence encoding a peptide selected from the group 

3 consisting of: amino acid (aa) 534-544, aa 291-304, aa 

4 697-705, aa 622-632, aa 886-895, aa 791-803, aa 1055- 

5 1066, aa 1621-1642, aa 1321-1331, aa 1451-1461, aa 

6 1516-1525, aa 1572-1582, aa 1677-1729, and aa 1269-1367 

7 of SEQ ID NO: 2. 

1 17. A purified protein, comprising a sequence 

2 that is at least 85% homologous to SEQ ID NO: 2: 
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1 18. The purified protein of claim 17, wherein the 

2 sequence is at least 90% homologous to SEQ ID NO: 2. 

1 19. The purified protein of claim 17, wherein the 

2 sequence is at least 95% homologous to SEQ ID NO: 2. 

1 20. The purified protein of claim 17, wherein the 

2 sequence is at least 98% homologous to SEQ ID NO: 2. 

1 21. The purified protein of claim 17, wherein the 

2 sequence is SEQ ID NO: 2. 

1 22 . A purified peptide, comprising a sequence 

2 selected from the group consisting of; amino acid (aa) 

3 534-544; aa 291-304; aa 697-705; aa 622-632; aa 386- 

4 895; aa 791-803; aa 1055-1066; aa 1621-1642; aa 1321- 

5 1331; aa 1451-1461; aa 1516-1525; aa 1572-1582, aa 

6 1677-1729; and aa 1269-1367 of SEQ ID NO: 2. 

1 23. The purified peptide of claim 22, that 

2 comprises aa 1269-1367 of SEQ ID NO: 2. 

1 24. A purified peptide that comprises at least 17 

2 contiguous amino acids (aa) from aa 1677-1729 of SEQ ID 

3 NO : 2 . 

1 25. An isolated nucleic acid molecule, comprising 

2 a first sequence that is capable of hybridizing at high 

3 stringency to a probe of a second sequence along said 

4 probes entire length, wherein said second sequence is 

5 nucleotides 3807-4101 of SEQ ID NO: 1. 

1 26. An isolated nucleic acid molecule , comprising 

2 a sequence that encodes the peptide of claims 19-21. 
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1 27. Bacillus thuringiensi s Cry toxin receptor 

2 antibody, that binds to an antigen present in the 

3 carboxyl tail of a BT-R2 protein as identified in SEQ 

4 ID NO: 2, and does not bind to silkworm or tobacco 

5 hornworm Bacillus thuringiensis Cry toxin receptors. 

1 28. A Bacillus thuringiensis Cry toxin receptor 

2 antibody, that binds to an antigen present in a BT-R2 

3 protein as identified in SEQ ID NO: 2, and also binds 

4 to silkworm and tobacco hornworm Bacillus thuringiensis 

5 Cry toxin receptors. 
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AACATTTACATACAGCCAGTGTAGATGACACATTGATTTAAAAAAAATAGTGCGAGTGCTTTGA 

ATCTGTGATTTCAAATATCGAATCAAAAGGACTGCATTAGTGTTGTGGGAGTTAAAGTGTTTGT 

GAGAATAGACCAACGACCATGCAAGATGGCGGGTGACGCCTGCATACTGGTGACGGTGCTTCTC 

ACCTTCGCAACATCAGTTTTCGGGCAAGAAACAACATCGTCGAGATGTTACTACATGACTGACG 

CTATTCCGAGGGAACCGAAACCGGATGATTTGCCTGACTTAGAATGGACTGGTGGATGGACCGA 

CTGGCCTTTGATCCCGGCTGAGCCAAGAGACGACGTGTGCATAAACGGCTGGTACCCACAACTC 

ACCAGCACTTCTCTCGGCACCATCATCATCCACATGGAAGAGGAGATCGAGGGAGATGTTGCTA 

TCGCTAAACTTAACTATGATGGTTCTGGAACCCCAGAAATTGTCCAGCCGATGGTTATAGGATC 

TTCTAACCTGCTAAGTCCAGAGATCCGGAATGAAAACGGGGCGTGGTACCTTTATATAACCAAT 

AGGCAAGATTATGAAACACCAACAATGCGTCGGTATACATTCGACGTCCGAGTGCCAGACGAGA 

CTCGTGCGGCACGAGTGAGTCTGTCCATCGAAAACATTGACGATAACGACCCTATCGTCAGGGT 

GCTAGACGCTTGCCAAGTGCCGGAATTGGGGGAGCCTCGACTAACAGACTGCGTTTACCAAGTG 

TCAGACGAAGATGGGAGGCTTAGTATCGAGCCCATGACATTCCGCCTCACATCAGACCGTGAAG 

ACGTACAGATATTCTATGTGGAGCCAGCTCACATTACTGGTGATTGGTTCAACATGCAAATTAC 

TATCGGTATCCTATCAGCGCTTAACTTCGAAAGCAACCCGCTGCACATCTTTCAAATCACTGCT 

TTGGACTCCTGGCCCAACAACCATACGGTGACGGTGATGGTGCAAGTCCAGAATGTGGAACACC 

GACCGCCGCGATGGATGGAAATCTTCGCAGTCCAGCAGTTTGACGAGATGACGGAGCAGCAATT 

CCAGGTGCGCGCCATCGACGGAGACACTGGCATCGGGAAAGCTATACACTATACCCTCGAGACA 

GATGAGGAAGAAGATTTGTTCTTCATCGAAACACTTCCGGGCGGCCATGACGGAGCCATCTTCA 

GCACTGCCATGATTGATGTGGATAGGCTCCGGCGAGATGTCTTCAGACTGTCCCTGGTGGCATA 

CAAGTACGACAATGTGTCCTTCGCCACCCCGACACCCGTCGTGATCATAGTCAATGACATCAAC 

AACAAGAAACCCCAACCGCTGCAAGATGAGTACACAATCTCCATAATGGAAGAAACTCCACTGT 

CGCTGAATTTTGCTGAACTTTTTGGTTTCTATGATGAAGATTTGATCTACGCACAATCCTTGGT 

GGAAATACAAGGCGAGAACCCTCCAGGCGTAGAGCAAGCGTTTTATATTGCGCCCACCGCAGGC 

TTCCAGAACCAGACATTCGCCATAGGGACTCAAGATCACCGAATGCTGGATTATGAGGATGTTC 

CTTTCCAAAACATCAAGCTCAAGGTAATAGCAACGGACCGTGACAATACCAATTTTACTGGAGT 

CGCGGAAGTCAACGTGAACCTGATTAATTGGAACGACGAGGAGCCGATCTTTGAGGAAGACCAG 

CTCGTTGTCAAGTTCAAGGAGACTGTACCCAAGGACTATCACGTCGGCAGACTGAGGGCTCACG 

ACCGGGACATAGGAGACAGCGTTGTGCATTCCATCTTGGGAAATGCGAATACATTTTTGAGAAT 

CGACGAAGAAACTGGCGACATATACGTAGCTATTGATGACGCGTTCGATTATCACAGACAGAAT 

GAATTTAACATACAAGTTCGCGCTCAGGACACCATGTCGGAGCCAGAGTCCAGGCATACAGCGG 

CTGCTCAGCTGGTCATAGAACTCGAGGACGTCAACAACACACCTCCTACTCTGAGGCTGCCTCG 

CGTAAGTCCGTCTGTAGAAGAGAATGTGCCAGAGGGCTTTGAAATCAACCGGGAGATAACCGCC 

ACGGACCCTGACACCACAGCATACCTGCAGTTTGAAATAGATTGGGACACATCCTTTGCCACTA 

AACAGGGGCGTGATACCAATCCAATAGAGTTCCACGGATGCGTGGATATAGAAACCATCTTCCC 

AAACCCAGCCGACACCAGAGAGGCTGTGGGGCGAGTGGTAGCGAAGGGGATCCGCCATAACGTG 

ACCATCCATTTTGAAGAGTTTGAATTTCTCTACCTCACAGTGAGAGTTCGGGACTTGCACACAG 

ATGACGGACGAGATTATGATGAATCTACCTTCACGGTAATAATAATAGATATGAACGACAACTG 

GCCTATCTGGGCGTCTGGTTTCCTGAACCAGACCTTCAGTATTCGGGAGCGATCATCTACCGGC 

GTCGTCATCGGGTCCGTACTCGCTACAGACATTGATGGCCCACTTTACAACCAAGTCCGGTACA 

CCATTATCCCCCAGGAAGATACTCCTGAAGGTCTAGTCCAGATACATTTCGTTACGGGTCAAAT 

TACAGTTGATGAGAATGGTGCAATCGACGCTGATATTCCACCTCGTTGGCACCTCAACTACACG 

GTTATAGCCAGCGACAAATGTTCTGAAGAAAATGAAGAGAACTGTCCCCCGGATCCAGTGTTCT 

GGGATACTCTGCGCGACAATGTAATTAACAT CGTGGACATAAACAACAAGGTCCCGGCAGCAGA 

CCTCAGTCGATTCAACGAAACGGTGTACATTTATGAAAATGCACCCGATTTCACGAACGTGGTC 
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AAGATATACTCCATCGACGAAGACAGAGACGAAATATATCACACGGTGCGGTACCAGATCAATT 
ATGCTGTGAACCAACGGCTGCGAGACTTCTTCGCCATAGACCTGGATTCAGGCCAGGTGTACGT 
GGAGAACACCAACAATGAGCTCCTGGATCGGGACAGAGGCGAAGACCAACACAGGATATTCATT 
AACCTCATTGACAACTTTTATAGCGAAGGAGATGGA^TAGAAATGTAAACACTACAGAGGTGC 
TGGTGATACTATTAGATGAGAATGACAACGCTCCTGAATTGCCGACTCCAGAAGAGCTGAGTTG 
GAGCATTTCCGAGGATTTACAAGAGGGTATAACACTCGATGGCGAAAGCGATGTGATATACGCA 
CCGGATATAGACAAAGAGGACACGCCAAACTCTCACGTTGGCTACGCAATCCTGGCCATGACAG 
TCACCAATAGAGACCTGGACACTGTTCCGAGACTTCTCAACATGCTGTCGCCTAACAACGTAAC 
CGGATTCCTCCAGACAGCAATGCCTTTGAGAGGATATTGGGGGACTTACGATATAAGTGTACTG 
GCGTTCGACCACGGTATTCCTCAGCAGATATCTCATGAGGTGTATGAATTGGAAATTCGACCTT 
ACAATTACAATCCTCCTCAGTTCGTTTTTCCTGAATCCGGGACGATTCTACGACTGGCTTTGGA 
ACGCGCAGTGGTAAATAATGTTTTGTCACTTGTAAACGGTGACCCGTTAGACAGGATACAAGCA 
ATTGACGACGATGGTCTTGATGCTGGCGTGGTGACTTTCGATATTGTTGGAGATGCTGATGCGT 

caaactacttcagagtaaataatgatggcgacagctttggaaccttgttgctgacacag'gcgct 

TCCTGAGGAAGGCAAGGAATTTGAGGTTACCATCCGGGCTACAGACGGCGGAACAGAACCTCGA 

TCATATTCAACAGACTCCACTATAACAGTCCTCTTCGTTCCGACTTTGGGTGATCCGATCTTTC 

AAGATAACACTTACTCAGTAGCATTCTTTGAAAAAGAGGTTGGCTTGACTGAGAGGTTCTCGCT 

CCCACATGCAGAGGACCCTAAGAACAAACTCTGCACTGACGACTGTCACGATATTTACTACAGG 

ATCTTTGGTGGTGTGGATTACGAGCCATTTGACCTGGACCCGGTGACGAACGTGATCTTCCTGA 

AATCAGAACTAGACCGGGAGACCACTGCTACGCATGTGGTGCAAGTGGCAGCCAGTAATTCGCC 

CACAGGAGGCGGAATACCACTCCCTGGGTCTCTTCTCACCGTCACTGTCACTGTACGAGAAGCG 

GATCCACGGCCTGTGTTCGAGCAGCGTCTGTACACGGCTGGCATTTCCACTTCCGATAACATCA 

ACAGGGAACTACTCACCGTTCGTGCAACTCATTCCGAAAACGCACAATTGACATATACCATCGA 

AGACGGTTCTATGGCGGTGGACTCCACTCTGGAAGCCGTCAAGGACTCGGCGTTCCATCTGAAC 

GCGCAGACCGGCGTCCTCATACTGAGGATACAACCTACTGCCAGCATGCAGGGCATGTTTGAGT 

TCAACGTCATCGCTACTGACCCAGATGAGAAGACAGATACGGCAGAGGTGAAAGTCTACCTCAT 

TTCATCCCAAAATAGGGTGTCCTTCATATTCCTGAACGATGTGGAGACGGTTGAGAGTAACAGA 

GACTTTATCGCAGAAACGTTCAGCGTTGGCTTCAACATGACCTGCAATATAGATCAGGTGCTGC 

CGGGCACCAACGACGCCGGGGTGATTCAGGAGGCCATGGCGGAAGTCCATGCTCACTTCATACA 

GGATAACATCCCTGTGAGCGCCGACAGTATTGAAGAGCTTCGCAGTGACACTCAGCTGCTGCGC 

TCCGTCCAAGGTGTGTTGAACCAACGGCTGTTGGTCCTGAACGACCTGGTGACGGGGGTCAGCC 

CTGATCTCGGCACTGCCGGCGTGCAGATCACCATCTATGTGCTAGCCGGGTTGTCAGCCATCCT 

TGCCTTCCTGTGCCTTATTCTGCTCATCACATTCATCGTGAGGACCCGAGCTCTGAACCGCCGT 

TTGGAAGCACTGTCGATGACGAAATACGGCTCGGTGGATTCGGGGCTGAACCGAGTGGGGATAG 

CGGCCCCAGGAACCAACAAACACGCCATCGAAGGCTCCAACCCCATCTGGAACGAGCAGATCAA 

GGCCCCGGACTTCGATGCCATCAGTGACACATCTGACGACTCTGATCTAATCGGCATCGAGGAT 

AGCCTGCAGGGAGACTTAGAAGAGAAAAGGGCAGACAAAGCAGTAGATGCCTTGGTGAAAAAGC 

TGAAGAAGAACGATGGAGCCATGGGGGAATACGAATTCAAGGCCTCTCGAGCCTCTAGAACTAT 

CGTGAGTCGTATTACGTATATCCAGACATGATGAGATACATTGATGAGTTTGGACAAACCGCAA 

CTAGAATGCAGTGAAAAAAATGCTTTATTTGTTGAAATTTGTGATGCTATTGCTTTATTTGGAA 

CCATTATAAGCTGCAATAAACAAGT-TAACATCATCAATTGCATTCATTTTATGTTTCAGGTTCA 

GGGGGAGGTGTGGGAGGCTATCC 
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SIG 

1 MAGDACILVT VLLTFATSVF 

51 TGGWTDWPLI PAEPRDDVCI 

101 KLNYDGSGTP EIVQPMVIGS 

151 RRYTFDVRVP DETRAARVSL 

2 01 CVYQVSDEDG RLSIEPMTFR 
251 ILSALNFESN PLHIFQITAL 

3 01 VQQFDEMTEQ QFQVRAIDGD 

351 GAI FSTAMID VDRLRRDVFR 
CR4 

4 01 PQPLQDEYTI SIMEETPLSL 

4 51 EQAFYIAPTA GFQNQTFAIG 

501 FTGVAEVNVN LINWNDEEPI 

551 GDSWHSILG NANTFLRIDE 

601 MSEPESRHTA AAQLVIELED 

651 ITATDPDTTA YLQFEIDWDT 

701 TREAVGRWA KGIRHNVTIH 

751 viiidmndnw PIWASGFLNQ 

801 RYTIIPQEDT PEGLVQIHFV 

851 KCSEENEENC PPDPVFWDTL 

901 NAPDFTNWK I YSIDEDRDE 

951 YVENTNNELL DRDRGEDQHR 

CR9 

1001 ENDNAPELPT PEELSWSISE 
1051 GYAILAMTVT NRDLDTVPRL 
1101 LAFDHGIPQQ" ISHEVYELEI 



3/9 

GQETTSSRCY YMTDAIPREP KPDDLPDLEW 
CR1 -> 

NGWYPQLTST SLGTIIIHME EEIEGDVAIA 

SNLLSPEIRN ENGAWYLYIT NRQDYETPTM 

CR2 -+ 

SIENIDDNDP IVRVLDACQV PELGEPRLTD 

LTSDREDVQI FYVEPAHITG DWFNMQITIG 

CR3 — 

DSWPNNHTVT VMVQVQNVEH RPPRWMEIFA 

TGIGKAIHYT LETDEEEDLF FIETLPGGHD 

LSLVAYKYDN VSFATPTPW IIVNDINNKK 

NFAELFGFYD EDLI YAQSLV EIQGENPPGV 

TQDHRMLDYE DVPFQNIKLK VIATDRDNTN 
CR5 — 

FEEDQLWKF KETVPKDYHV GRLRAHDRDI 

ETGDIYVAID DAFDYHRQNE FNIQVRAQDT 
CR6 — 

VNNTPPTLRL PRVSPSVEEN VPEGFEINRE 

SFATKQGRDT NPIEFHGCVD IETIFPNPAD 

FEEFEFLYLT VRVRDLHTDD GRDYDESTFT 
CR7 -* 

TFSIRERSST GWIGSVLAT DIDGPLYNQV 

TGQITVDENG AIDADIPPRW HLNYTVIASD 

CR8 -> 

RDNVINIVDI NNKVPAADLS RFNETVYIYE 
IYHTVRYQIN YAVNQRLRDF FAIDLDSGQV 
IFINLIDNFY SEGDGNRNVN TTEVLVILLD 
DLQEGITLDG ESDVIYAPD-I DKEDTPNSHV 
LNMLS PNNVT GFLQTAMPLR GYWGTYDISV 
RPYNYNPPQF VFPESGTILR LALERAWNN 
Fieure 2A 
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CR10 — 

1151 VLSLVNGDPL DRIQAIDDDG LDAGWTFDI VGDADASNYF RVNNDGDSFG 

1201 TLLLTQALPE EGKEFEVTIR ATDGGTEPRS YSTDSTITVL FVPTLGDPIF 

CR11-* MBF 
1251 QDNTYSVAFF EKEVGLTERF SLPHA EDPKN KLCTDDCHDI YYRIFGGVDY 

1301 EPFDLDPVTN VIFLKSELDR E TTATHWO V AASNSPTGGG IPLPGSLLTV 

CR12-> ~ 

13 51 TVTVREAD PR PVFEORLYTA GISTSDNINR ELLTVRATHS ENAQLTYTIE 

14 01 DGSMAVDSTL EAVKDSAFHL NAQTGVLILR IQPTASMQGM FEFNVIATDP 

f — MPD 

1451 DEKTDTAEVK VYLISSQNRV SFIFLNDVET VESNRDFIAE TFSVGFNMTC 

LZ 

1501 NIDQVLPGTN DAGVIQEAMA EVHAHFIQDN IPVSADSIEE LRSDTQLLRS 

1551 VQGVLMQRLL VLNDLVTGVS PDLGTAGVQI TIYVLAGLSA ILAFLCLILL 
— CYT 

1601 ITFIVRTRAL NRRLEALSMT KYGSVDSGLN RVGIAAPGTN KHAIEGSNPI 
1651 WNEQIKAPDF DAISDTSDDS DLIGIEDSLQ GDLEEKRADK AVDALVKKLK 
17 01 KNDGAMGEYE FKASRASRTI VSRITYIQT. 

Figure 2B 
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Figure 5A 
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Figure 5B 
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Figure 5C 
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AAC All T AC ATACAGC CAG T'G TAG AT GACACAT TGATT T AAAAAAAATAGT GC GAG TGCT T'T GA 
ATCTGTGAT TTCAAATATCGAATC AAAAGGACT GC ATT AGTGTTG TGGG AGTT AAAGT GT TT G T 
G AG AAT AGACCAACG ACC ATG CAAGATG GCGGG TGACG CCTGCATAC T'G GT GACG GTGCT PCTC 
ACCTTCGCAACATCAG T'TT TC GG GCAAGAAACAACATC G TCGAGAT'G T'T AC TACATGACT GACG 
C TAT T'CCGAGGGAAC CGAAAC CGGATGATTTGCCT GACT TAGAATGGAC T'G GT GG ATG GACCGA 
C 7GG CCTT T GATCCCGGCTGAGCCAAGAGACGACG TGT GCATAAACGGC TG GT ACCCACAAC TC 
ACCAGCACT TCTC TCGGC ACC ATCAT CATCC AC AT GGAAG AGGAG ATCGAG GGAGAT G TT GC T A 
T CGC TAAAC 7TAACTATGATG GT TCTGG AACCCCAGAAA T TG TC CAGCC GAT'G GT TAT AG GATC 
T 7CTAACCT GCTAAG TCCAGAGATC CGGAATGAAAAC G GG GC GTGGT AC CT TT AT AT AACCAAT 
AGGC AAG AT TATG AAACACCAACAATGC GTCGG TATAC AT TC GACG7CC GAGTSCCAGACGAGA 
C TCG TG CGG CACGAG TGAG TC TG TCC AT CGAAAAC ATT GACG ATAACGACC CTATCG TC AGGGT 
GCTAGACGC TTGCCAAGT GCCGG AAT TG GGGC-AGC CTC GACT AACAGAC TG CG TT TACCAAG TG 
T C AG AC G AAG AT6GG AGG C TT AG TATCG AGCC C ATG AC AT TC CGCC TCACATC AGAC CG TGAAG 

ACGTACAGATATTCTATGTGGAGCCAGCTCACAT'TACTGGTGATTGGTTCAACATGCAAATTAC 
TATC GGTAT CCTA'T'C AGC GCT TAAC T TC GAAAGCAAC CCG CT GCACATC TT TCAAAT CAC TGCT 
TTGG ACTCC TGGCCCAACAACCATACGG TGACGGTGATGG TGCAAGTCC AG AATG TG GAACACC 
GACC GCCGC GAT GGATGG AAATC TT CGC AG TCCAGCAG TT TGACG AG AT GACG GAGC AGCAATT 
CC AG GTGCG CGCCATCGACGGAG ACAC TGG CATC GGGAAAGC TATAC AC TATAC OCT CGAGACA 
GATC AGGAAGAAGAT TTGTTC TTCATC-GAAACAC T TC CGGGC GGCC ATG AC GGAGCCAT C TTCA 
GC AC TGCCATGATTG ATG TGG AT AG GC TCCGG CGAGATGTCT TC AGACT GT CCCTGG TG GCATA 
CAAG TACGACAATGTGT C CT TCGCCAC CCCGACACC C G TCGT GATCATAGTCAATGACATC? AC 
AACAAGAAACC C CAACC GCT GCAAG ATGAG TACACAATCTCC ATAATGG AAGAAACTCC ACTGT 
CGCT GAATT TTGCT GAAC TT TTTGG TT TCTATGATGAAGATT TG'ATCTACG CACAATCC TTGGT 
GGAAATACAAGGCGAGAACC CTCCAGG CGTAGAG CAAGCG TT TT ATATT GCGCCCAC CG CAGGC 
TTCC AGAACCAGAC ATT CGC CATAGGGACTCAAGATCACCGAAT GCTGG AT TATGAG GATGT TC 
CT TT CCAAAACATCAAG C TCAAGGTAATAGCAAC GGACCG TG AC AATACCAAT TT TACT GGAGT 
CGCG GAAG TCAACG TGAACC TGA'TTAATT GGAAC GAC SAGGAGCCGATC TT TG AGGAAG AC CAG 
CTCG TT GT C A^G T TCAAGGA-3ACTG TACC CAAGGAC TATCACGT CGGCAGACTGAGGGC TCACG 
ACCG GGAC ATAGGA'GAC AGCGT TG TGCAT TCCATCT TGG GAAAT GC GAATACATTTT TGAGAAT 
C GAC G AAG AAACT GGCGACATATACGT AG CTATTGATGACGCGT TCGAT TATCACAGACAGAAT 
GAAT TT AACATAC AAG T TC GC GCT CAGGACACCATG TCG GAGCC AGAGT CC AGGCATA.C AG CGG 
C TGC TC AG CT GGTCAT AGAACTCGAG GACGT CAACAACACACCT CCTAC TC TGAGGCTG CC TCG 
CGTAAG TC CGTCTGTAGAAGAGAATG TGCCAGAGGG C T'T TGAAATCAACCGG'GAGATAACCGC C . 
ACG G ACCC TG ACACCACAGCATACCTGCAGT TT GAAATAGATTGGGACACATCC TT TGCCACT A 
AACAGGGGCG TG ATACCLAATCCAATAGAGTTCC AC GGATGC G TGGATAT AG AAACC ATC TTCC C 
AAACCC AGCCGAC ACCAG AGAGGC TG TG GGGCG AG TGG TAG CGAAGGGG ATC CG CCAT'AACGTG 
ACC AT CCATT TT GAAGAG TTT GAATT TC TC7 ACCT CACAGT GAG AGTTCGGGACTT GCACACAG 
ATG ACGGACG AG AT TATG ATG AATCTACCT TCACG G TAATAATAATAGATATGAACGACAACTG 
G CC TATC TGGGCGT C TG G TT TCC TG AACCAGACCT TCAG TATTC GGG AGCG ATCATC TACC GGC 
G TC GTCATC GGG TC CGTACTCGC TACAGACAT TGATGG CCCACT TT ACAACCAAGTCCGGT ACA 
CCATT AT CC CC CAGGAAGAT AC TCC TGAAGGT CTAGTC C AGATACATTTCG TTACG GGTCAAAT 
T AC AG T'T GAT'G AG AAT GGTGCAATCGACG C T'GA'TATTC CACCT CGT TG GCAC CT CAACTACAC G 
G TT AT AGCC AG CGACAAAT Q TT CTG AAGAAAATGAAG AGAACT G T'CCC CC GGATCC AG TGTTC T 
GGG AT AC TC TG CGCG AC AA7GT AATT AAC AT CGTGG ACAT AAACAACAAG GTCCCGGCAGCAGA 
CC 7CAGTC GAT TCAACGAAAC G GT GTACATT TATGAAAATGCACCCG ATT TC ACGAACGTGG TC 
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AAGATATAC TC CATC GACGAAGAC AG AGACGAAAT AT AT CACACGG T'GCG G TACCAGAT GAATT 
ATGC TGTGAAC CAAC GGC TGCGAGAC T T'C T TC GCCATAGAC CT GGAT'T CAGGC CAGGTG TACGT 
GGAG AAGACCftAC AATGAGC ICC TGGATCGGGACAGAGGCGAAGACCAACACAGGATAT TCATT 
AACC TCATTGACAAC TTT TAT'AGCGAAGG AGATCG AAA T AG AAATG TAAACAC TAC AG AGGTGC 
TG GT GATAC TAT TAGATG AG AATGAC AACGCT CC TGAAT TG CCGAC TCC AGAAGAGC TGAGT TG 
GAGCATT TC CGAGGAT T T AC AAG AGG G TATAACAC TC GAT'GGC GAAAGCG AT Q TG ATATACG CA 
CC GGATA TAGAC AAAGAGGACAC GCC AAACTC TCACG TTGGCT ACGCAATCC TGG CCAT G AC AG 
TC ACC A^TAG AG ACCTGG AC ACT GTTCCG AGACT TCT CAftCAT GC TGTC GCC TAACAAC G TAAC 
CG GAT TCCTCC AGACAGC AATGC CTT TGAGAG GATAT TGGGGG AC T TACGAT ATAAG TG TAC TG 
GC G TT CGACCACGG TATTCC TCAGCAG AT ATC TCAT'G AGGTGT AT GAAT TOG AAATTCG AC CTT 
ACAAT TACAATCCTCC TCAG T TC GTT T TT CCT GAATCCGGG AC GATT CTACGACTGGCTTTGGA 
AC GCG C AG TGGTAAATAATG T TT TGT C AC TTG TAAACGG TG AC CC GT TAGAC AGGA 7ACAAGCA 
ATTGACGAC GATGG TC TT GATGC TGG CGTGGT GAC TT TCGATATT GTTGG AGATGCTGATG CGT 
CAftAC TAC TTC AGAGT AAATAAT GAT GG C GAC AG CTT T'GGAAC CT TG TT GOT GACA C AGGC GOT 
TCC TGAGG AAGGCAAGGAAT TTG AGG T TACCATCCGG GC TACAGACGGCGGAACAGAACCT CGA 
TCATATTCAACAGACTCC AC TAT AAC AG T CCT CT T CG TTCCGACTT T GGG TGATCCG AT CT T TC 
AAGATAACACTTACTCAGTAGCATTCTTTGAA*^ 

CCC ACATGCAGAGGACCC fAAGAACAAACTCTGCACTGACGACrGTCACGATATTTACTACAGG 
AT C TT TGG TGGTGTGGAT TACGAGCC ATT TGACCTGGACCCGG TGACGAACGTGATC TT CC TGA 
AAT CAGAAC TAGACCGGGAGAC C ACT GC TAC GCATGT GG TGCAAGTGGCAGC CAG TAAT TCGCC 
C AC AGG AGGCGG AATACC AC TC C CTG GGTCT C TT C TC ACCG TC AC TGTCACTG TACGAGAAGCG 
GAT CC AC GGCCTGTGT TCGAGCAGCG TC TGTAGACGG CTGGCAT TTCC AC T1CCG AT AACATCA 
AC A GGG AAC TAC TCACCG TT CGTGCAACTCAT TC CGAAAACGC ACAAT TGACATATACCATCGA 
AGACGG TTCTATGGC GGTGG ACTCCACT C TG GAA GCCGTCAAGGAC TCGGCG T TCCAT CTCAAC 
G CG CAG ACCGGCGTC C TCAT AC T GAGGAT ACAACC TACTGC CAGCATGCAGGGCATGTT TGAGT 
TCAACG T C ATCGCTACT GACCCAGAT GAGAAi^AC A-GATACG GCAGAGG T'GAAAGT CT AC CTCAT 
T T C ATCCCAAAAT AGGGTGTCCT TCATAT TCC TG AAC GATG TG GA6ACGG TTG AG AG T AAC AGA 
GAC TT TATC GCAG AAACG TTCAGCGT TG GCT TCAACATG ACCT GCAAT AT AGATC AGG TGC TGC 
GGG GCACCAAC GACG CCGGGGT G ATTCAGGAGGC CATGGCG GAAGTCC ATGC TCACT TCAT AC A 
GGATAACAT CC CTGT 6AGCGCCGACAGTATT GAAGAGCT TCGCAGTGACACTCAG CTGCTGC GC 
TCC GTCC AAGG TG TG TTGAACC AAC GGC TGT TGG TCC TGAACGAC C TGGT GACGG GGGT CAG CC 
CTG ATCTCGGC AC TGCC GGCGTGCAGAT CAC CAT CTATG TGCT AGCCGGG TTG TCAG CCATCCT 
TGCCT TC CT GTGCC T TATT CT GCT C ATC ACATTC ATCGTGAGG ACCCG AG CT C TG AACCGCCGT 
TTGGAAGCACTGTCGATGACGAAATACGGCTCGGTGGATTCGGGGCTGAACCGAGTGGGGATAG 
CG GCC CC AGGAACC AACAAACACG CCATCGAAGGCTCCAACCC CATCTGG AACG&GCAG ATCAA 
GG CCCCGGACTT CGATGC CAT CAG TGACAC ATC TGACGACTCT GATCTAATC GGC ATCGAGG AT 
AG CC TGC AGGGAGACT TAGAAGAG AAAAGG GCAG ACAAAGC AG TAG ATGC CTTGG TGAAAAASC 
TGAAGAAGAACGAT GGAGCCATGG GGG AAT ACGAATTCAAG GC C TC TC GAGCC TCTAG AACT AT 
CG TGAGTCGTAT TACG TATATCCAGACATGATGAGATACAT TGATGAG TT TGGACAAACCGCAA 
CT AG AATG C A G T G AAAAAAA T G C T TTATT TG TTG AAA TT TG TGATG CTAT TGC TTTAT TTG GAA 
C CAT TATAAG CTGCAATAAACAAG TT AAC ATCATCAATTGC AT TCATT TTATG TTTCAGG TT CA 
G GGGGAGG TG TGG GAGGC T ATC C 
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1 MAQDACILVT VLLTFATSVP GQETT3SRCY YMYDAIFRBP KFDOLPDLEW 
^ ^ ^ CR1 

5"! TCGWTDWPLI PAEPPJjDVCI ftGWYPQLTST SLGTIIIHME EEIEGPVAIA 

101 KLNYDGS3TF EIVQPMVIGS SMLLSPEIRM EHGAWYLYIT NRQDYETPTO 

CR2 -* 

151 RRYTFDVRVP DETRAARVSL SlENIDDflDP IVRVLDACQV PELGEPRLTD 

201 CVYQVSDEDG RLSIEPMTER LTSDREDVQl FYVEPAHIT6 DWFHMQITIG 

251 ILSALNPESK) PU3IPQITAL DSWPNNHTVT VMVQVQNVEH RPPRWMEIFA 

301 VQQFDEMTEQ QFQVRAIDGD TfllGKAIKYT LfiTDEEEDLP PIETLPGGHD 

251 GAIFSTAMIB VDRLRRDVBR LSLVAYKYDlS VSPATPTPW IIVNDINNKK 
CR4 ~> 

401 PQPLQDEYTI SIMEETPLSL NFAELFGFYD EDLIYAQSLV EIQGENPPGV 

451 EQAFYIAPTA GFQNQTFAIG T QD K R ML D Y E DVFFQNlKLK VIATDRDNTff 

CR5 -* 

501 FTGVAEVHVN LIKWHDEEFI FEEDQLWKF KETVPKDYHV GRLRAHDRDI 

551 GDSWHSILG NAMTFLRIDE E7GDIYVAID D AF D Y HR QNE FNI QVRAQDT 

CR6 -+ 

601 KSEFESRHTA AAQLVIELED VHNTPFTLRL PRVSPSVBEN VPEQFEINRE 

£51 ITATDPDTTA YLQFEIDWDT SFATKQGRDT NFIEFHGCVD IETIFPfiTPAD 

701 TREAVGRWA KGIRHfiVTIH FEEFEFLYLT VRVRDLRTDD GRDYDESTFT 

CR7 

751 VII3DMSENW PIWASGPLffQ TFSIRERSST GWIGSVLAT DIDGFLYNQV 

801 RYTIIPQEDT PEGLVQ1HFV TGQITVDENG AIDADIPPRW KL H Y TV I AS D 

CR8 

S51 KCSEEMEENC PPDPVFWDTL RDNVINIVD3 HNKVPAADLS RFNETVYIYE 

9D1 NAPDFTNVVK IYSIDEDRDE IYHTVRYQIH YAVNQRLRDF PAI DLDSGQV 

B 51 YVEHTMNELL DRDRGEDQKR IFIMLIDNFY SEGDGNRUVN TTEVLVILLD 

CR9 -* 

1091 EiroriAPELPT PEELSWSISE dlqegitldg esdviyapdi dkedtpnshv 

1051 GYAILAMSTVT NEDLDTVPRL. DRMLSPNfJVT GFLQTAMPLR GYWGTYDISV 
11 Dl LAPDHGIPQQ ISHEVYELEI RPYNYMPPQF VPPESGTILR LALERAWHN 

Ffgure 1A 
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CR10 -t 

1151 VLSLVNGDPL DRIQAIDDDG LDAQVVTFDI. VflDADASMYF RVHNDGDSFG 

1201 TLLLTOALPB EGKEFEVTIR ATDGGTEFRS YSTDSTITVL FVPTLGDPIF 

CR11-* MBF 
1251 QDMTYSVAFF BKEVGLTE RF SLFHABPPEN KLCTODCKDI YYRIFtigVDY 

1301 EPFDLDPVTN VIFLKSELDR ETTATHWOV AASHSF TGGG IPLPGSLLTV 

CR12-* 

1351 TVTVREADFR PVFEQRLYT A GISTSDNINR ELLTVRATHS ENAQLTYTIE 

1401 DGSMAVDSTL EAVKDfiAFHL NAQTGVLILR IQPTASMQGM FE FHVI ATDP 

MFD 

1451 DEKTDTAEVK VYL3SSQHRV SFIFLNDVET VESHRDFIAE TFSVGFHMTC 

LZ 

1501 NIDQVLPGTN DAGVIQEAMA EVHAHF3QDN IPVSADSIEE LRSDTOLLES 
IB 51 VQGVUJORIh VLNDLVTGVS PDLGTAGVOI TIYVLAGLgA ILAFLCLILL, 
1601 ITPIVRTRAL HSRLEALSMT KYGSVDSGLN RVGIAAPGTN KHAIEGSNPI 
1651 WNEQIKAPDF DAISDTSDDS DLIG3EDSLQ GDLEEKRADK AVDALVKKLK 
1701 KNDGAMQEYE FKASRASRTI VSRITYIQT. 
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Figure 5A 
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